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Abstract. We consider the links between Ramsey tiieory in the integers, 
based on van der Waerden's theorem, and (boolean, CNF) SAT solv- 
ing. We aim at using the problems from exact Ramsey theory, concerned 
with computing Ramsey-type numbers, as a rich source of test problems, 
where especially methods for solving hard problems can be developed. We 
start our investigations here by reviewing the known van der Waerden 
numbers, and we discuss directions in the parameter space where possi- 
bly the growth of van der Waerden numbers vdwm(fci, . . . , km) is only 
polynomial (this is important for obtaining feasible problem instances). 
We introduce transversal extensions as a natural way of constructing 
mixed parameter tuples (fci, . . . , km) for van-der- Waerden- like numbers 
N(A;i, . . . , km), and we show that the growth of the associated numbers is 
guaranteed to be linear. Based on Green- Tao's theorem ( "the primes con- 
tain arbitrarily long arithmetic progressions") we introduce the Green- 
Tao numbers grt^(A:i, . . . , km), which in a sense combine the strict struc- 
ture of van der Waerden problems with the (pseudo-)randomness of the 
distribution of prime numbers. Using standard SAT solvers (look-ahead, 
conflict-driven, and local search) we determine the basic values. It turns 
out that already for this single form of Ramsey-type problems, when 
considering the best-performing solvers a wide variety of solver types is 
covered. For m > 2 the problems are non-boolean, and we introduce 
the generic translation scheme, which offers an infinite variety of trans- 
lations ("encodings") and covers the known methods. In most cases the 
special instance called nested translation proved to be far superior over 
its competitors (including the direct translation). 



1 Introduction 



The applicability of SAT solvers has made tremendous progress over the last 15 
years; see the recent handbook[B[. We are concerned here with solving (concrete) 
combinatorial problems (see p3| for an overview). Especially we are concerned 
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with the computation of van-der-Waerden-like numbers, which is about colouring 
hypergraphs of arithmetic progressions 

An arithmetic progression of size fc G Nq in N is a set P C N of size k such 
that after ordering (in the natural order) , two neighbours always have the same 
distance. So the arithmetic progressions of size fc > 1 are the sets of the form 
P = {a + i ■ d : i G {0, . . . , fc — 1}} for a, d e N. Van der Waerden's Theorem 
(Q) shows that whenever the set N of natural numbers is partitioned into 
finitely many parts, some part must contain arithmetic progressions of arbitrary 
size. The finite version, which is equivalent to the above infinite version, says 
that for every progression size fc G N and every number to G N of parts there 
exists some ng G N such that for n > tiq every partitioning of {1, . . . ,n} into 
TO parts has some part which contains an arithmetic progression of size fc. The 
smallest such uq is denoted by vdwm{k), and is called a vdW-number. The 
subfield of Ramsey theory concerned with van der Waerden's theorem is for over 
70 years now an active field of mathematics and combinatorics; for an elementary 
introduction see . 

We are concerned here with exact Ramsey theory, that is, computing vdW- 
like numbers if possible, or otherwise producing (concrete) lower bounds. 
introduced the application of SAT for computing vdW-numbers, showing that 
all known vdW-numbers (at that time) were rather easily computable with SAT 
solvers. With Q yet SAT had its biggest success, computing the new (major) 
vdW- number vdw2(6) = 1132 (the problem of computing vdw2(6) is mentioned 
in ||2^ as a difficult research problem). See for the current state-of-the-art. 
Regarding lower bounds, the best lower bounds currently one finds in |ll|] .p^ 

VdW-numbers for "core" parameter values (see Definition ||) grow rapidly, 
and thus only few are known (see Section The first contribution of this article 
is the notion of a transversal extension (see Definition |2|) of a parameter tuple, 
which allows to grow parameter tuples such that (only) linear growth of the 
associated vdW-numbers is guaranteed. The linear growth is proven in a general 
framework in Theorem 10, and applied to vdW- numbers in Corollary [Tl|. 



Next we introduce Green- Tao numbers ( "GT- numbers" ; see Definition [l3| ), 
which are defined as the vdW-numbers but using the first n prime numbers 
instead of the first n natural numbers. The existence of these numbers is given 
by the celebrated Green- Tao Theorem (|^). In Corollary we show that also for 
GT-numbers transversal extension numbers grow only linearly. In the remainder 
of the article we are concerned with computing GT-numbers. 

For binary parameter tuples (to = 2 above) the problems of computing vdW- 
or GT-numbers have a canonical translation to (boolean) SAT problems, while 
for TO > 2 we still have a canonical translation into non-boolean SAT problems 



Thi s report is an extended version of pl|]. 

see ittp: //www . St . ewi . tudelf t .nl/sat/waerden.php for updates 
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(as is the case in general for hypergraph colouring problems; see [^), but for 
using standard (boolean) SAT solvers the problem of a boolean translation arises. 
In Section ^ we introduce the generic translation scheme, with seven natural 
instances (amongst them the well-known direct and logarithmic translations). 
As it turns out, in nearly all cases for all solver types the weak nested translation 
(introduced in (^) performed far best, with the only exception that for relatively 
large numbers of colours the logarithmic translation was better. 

For this (initial) phase of investigations into GT-numbers we just used "ofF- 
the-shelves" SAT solvers, also aiming at some form of basic understanding why 
which type of solver is best on certain parameter ranges. For over one year 
on average 10 processors were running, with a lot of manual interaction and 
adjustment to find the right solvers and translations, and to set the parameters 
(most basic the number of vertices), establishing the basic Green- Tao numbers. 
All generators and the details of the computations are available in the open- 
source research platform OKlibrary (see [0).0 See Section |5] for the results 
of these computations. We conclude this article by a discussion of interesting 
research directions in Section |^. 

2 A few notions and notations 

We use No = Z>o and N = No \ {0}. A finite hypergraph G is a pair G = {V, E) 
where is a finite set and E C WiV) (that is, E is set of subsets of V)\ we 
use V{G) := V and E{G) := E. An m-colouring of a hypergraph G is a map 
/ : V{G) {1, ■ . ■ such that no hyperedge is monochromatic, that is, for 
every H G E{G) there are v,w ^ H with f{v) ^ f{w). Regarding (boolean) 
clause-sets, complementation of boolean variables v is denoted by v, (boolean) 
clauses are finite and clash- free sets of (boolean) literals, and (boolean) clause- 
sets are finite sets of (boolean) clauses. 

3 The theorem of Green- Tao, and Green- Tao numbers 

The numbers vdwm(fc) introduced in Section |^ are "diagonal vdW-numbers" , 
while we consider also the "non-diagonal" or mixed vdW-numhers, which are 
defined as follows. 

Definition 1. A parameter tuple is an element o/N>2 for some m S N which 
is monotonically non- decreasing (that is, sorted in non- decreasing order). For 
a parameter tuple (fci, . . . , fc„j) the vdW-number vdWj„(A;i, . . . , km) is the 

smallest no S N such that for every n > uq and every f : {!,..., m} 

there exists some "colour" (or "part") i € {1, . . . ,m} such that f^^{i) contains 
an arithmetic progression of size ki . 

tittp : //www . ok-sat-library . org 
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In a systematic study of parameter tuples and their operations, one likely should 
drop the sorting condition, and call our parameter tuples "sorted" , however in 
this report we only consider sorted parameter tuples. 

Obviously we have vdwm(fci, ■ • ■ , km) < vdwm (max(A:i , . . . , km)) (note that 
the right-hand side denotes a diagonal vdW- number), and thus also the mixed 
vdW- numbers exist (are always finite). The most up-to-date collection of precise 
(mixed) vdW- numbers one finds in 1^ For the sake of completeness we state 
the numbers here, but for references we refer to [|l]j2j. We introduce the following 
organisation of the parameter space. 

Definition 2. A parameter tuple is trivial if all entries are equal to 2, otherwise 
it is non-trivial. A simple parameter tuple has length 1, otherwise it is non- 
simple. A parameter tuple is a core tuple if it is non-simple and if all entries 
are greater than or equal to 3. A parameter tuple t is a (transversal) extension 
of a parameter tuple t' if t can be obtained from t' by adding entries equal to 2 to 
the front oft'. A transversal extension of a simple parameter tuple is called an 
extended simple tuple or a transversal tuple, while an transversal extension 
of a core tuple is called an extended core tuple. Finally a parameter tuple is 
diagonal, if it is constant (all entries are equal), while otherwise it is non- 
diagonal or mixed. 

Accordingly we speak of (and are interested in) trivial vdW-numbers, sim- 
ple vdW-numbers, core vdW-numbers, transversal vdW-numbers, ex- 
tended core vdW-numbers, and diagonal vdW-numbers. 

The trivial vdW- numbers are vdwm(2) — m + 1, while the simple vdW- numbers 
are given by vdwi(fc) = k. The known core vdW-numbers are as follows. 

1. 24 binary core vdW- numbers vdw2(a, b) are known: 





3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


18 


3 


9 


18 


22 


32 


46 


58 


77 


97 


114 


135 


160 


186 


218 


238 


279 


312 


4 




35 


55 


73 


109 


146 






















5 






178 


206 


























6 








1132 



























2. 4 core vdW-numbers vdw3(a, b, c) and one core vdW-number vdw4(a, 6, c, d) 
are known: 



a,b ^ 


3 4 5 


3,3 


27 51 80 


3,4 


- 89 



h d 

a, 0, c 


3 


3,3,3 


76 



The (updated) online-version is ittp : //users . encs . concordia. ca/~ta_ahined/ 
vdw . html . 
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A basic quest for this article is in what "directions" can one move through the 
parameter space while experiencing only polynomial growth? Regarding vdW- 
numbers we consider the conjecture (perhaps better called "question"), that 
extended core parameter tuples grow only polynomially in the extension length, 
where for parameter tuples a, 6 by a; 6 we denote their concatenation; 

Conjecture 3. For every (fixed) parameter tuple t of length m the map fc G N 
vdw„i_|_i(/;; (fc)) is polynomially bounded in k (depending on t). 

In other words, for the above tables and all similarly constructed tables growth in 
every row is polynomially bounded. The evidence for Conjecture is as follows. 



1. 



The case t = (3), and more precisely vdw2(3,fc) < fc^, has been suggested 
in The numbers vdw2(3, /c) are known for 1 < fc < 18 (see above). 
Additionally, our experiments yield the following conjectured values (where 
using "> a;" means that we believe that actually equality holds, while the 
lower bound "> x — 1" has been shown), further supporting the conjectured 
upper boundQ vdw2(3, 19) > 349, vdw2(3, 20) > 389, vdw2(3, 21) > 416. 

2. Considering t = (4), the numbers vdw2(4, fc) are known for 1 < fc < 8 (see 
above), while the bound vdw2(4, 9) > 254 is in [0; we can improve this to 
vdw2(4, 9) > 309, and furthermore vdw2(4, 10) > 328. So going from fc = 8 
to fc = 9 we see a rather big jump, however possibly from fc = 9 to fc = 10 
only a small change might take place. 

3. For general t — (fco) with fco > 3, in Q the lower bound vdw2(fco,fc) > 
^fco-i-iog(iog(fc)) Jqj. sufficiently large fc has been shown. It seems consis- 
tent with current knowledge that we could have vdw2(fco,fc) < k^°~^ for 
all fc, fco > 1. 

A contribution of this article is the systematic consideration of transversal ex- 
tensions as defined in Definition ^. The known 33 -|- 10 -I- 1 -f 6 = 50 extended 
core vdW- numbers are as follows (again, for references see ^ ) ; transversal vdW- 
numbers are presented in Section A.l; see Subsection 3.3 for general remarks. 



1. Extending (3, fc) by m 2's, i.e., the numbers vdwm+2(2, . . . , 2, 3, fc): 



fc 

m 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 





9 


18 


22 


32 


46 


58 


77 


97 


114 


135 


160 


1 


14 


21 


32 


40 


55 


72 


90 


108 


129 


150 


171 


2 


17 


25 


43 


48 


65 


83 


99 


119 








3 


20 


29 


44 


56 


72 


88 












4 


21 


33 


50 


60 
















5 


24 


36 




















6 


25 






















7 


28 























All lower bounds are obtained by local-search algorithms from the Ubcsat-suite (see 
|3l|]), and all data is available through the DKlibrary. 
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2. Extending (4, fc) resp. (5, fc) by m 2's, i.e., numbers vdwm+2(2, . . . , 2, 4, fc) 
resp. vdw„+2(2, • . • , 2, 5, k): 



k 

m 


4 


5 


6 


7 8 





35 


55 


73 


109 146 


1 


40 


71 


83 


119 


2 


53 


75 


93 




3 


54 


79 






4 


56 









k 

m 


5 





178 


1 


180 



3. Extending (3, 3, fc) by m 2's, i.e., numbers vdwm_|_3(2, . . . , 2, 3, 3, fc): 



fc 

m 


3 


4 


5 





27 


51 


80 


1 


40 


60 


86 


2 


41 


63 




3 


42 







Note that by Conjecture ^ we would have in every row only polynomial growth. 
Now in Corollary ^ we will prove that in every column we have linear growth, 
where actually the factor can be made as close to 1 as one wishes, when only m 
is big enough. 



3.1 A general perspective on Ramsey theory 

We consider a sequence (G„)„gN of finite hypergraphs, where we assume that 
we have V{Gn) ^ V{Gn+i) and E{Gn) Q E{Gn+i) for all n. Furthermore we 
assume V{Gi) ^ and Vn G N : ^ E{Gn) for simplicity. Such a sequence of 
hypergraphs we call nontrivial monotonic. We consider the following questions: 

(i) Does there exist some n e N with E{Gn) / ? 

(ii) Does for every m £ N exists some no(m) S N such that for all n > no the 
hypergraph G„ is not m-colourable? In this case we say that (G„)„gN has 
the Ramsey property. 

(iii) Does lim„_^oo Jv^^ — ^ hold, where a{G) for a hypergraph G is the in- 
dependence number of G, the maximum size of an independent vertex set 
(not containing any hyperedge)? In this case we say that (G„)„gN has the 
Szemeredi property. 

Clearly (ii) implies (i), while in turn (iii) implies (ii), since colouring a hyper- 
graph G with m colours just means to partition V{G) into at most m indepen- 
dent subsets. Considering the original vdW-problem, we have G„ = ap(fc, n) for 
some fixed fc £ N, where y(ap(fc, n)) — {1, . . . , n}, while £'(ap(fc, n)) is the set 
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of arithmetic progressions of size k in {!,..., n}. Property (i) trivially holds, 
while property (ii) is van der Waerden's theorem. And property (iii) has been 
conjectured by Erdos and Turan in 1936 (§), and was finally proved by Sze- 
meredi in his landmark paper (for arbitrary fc, one of the deepest results in 
combinatorics; for fc = 3 it was proven in ||2^, for fc = 4 in [p9|). 

Consider a nontrivial monotonic sequence G = (G„)„gN of hypergraphs. The 
following definition generalises diagonal vdW-numbers, and introduces a form of 
"convergence rate" capturing the Szemeredi property. 

Definition 4. For m e N let N^(G) e N U {+00} be the infimum 0/ n S N 
such that Gn is not m-colourable. And for q £ IR.>o let cr(G, q) £ N U {+00} be 
the infimum 0/ n G N such that for all n' > n holds \y^^"'^-^ \ < <?• 

Thus G has the Ramsey property iff for all m e N we have Nm(G') < +00, 
while G has the Szemeredi property iff for all q e]0, 1] we have cr(G,q) < 
+00. Considering the sequence (ap(fc,n))„gN of vdW- hypergraphs of arithmetic 
progressions of size k we have Nm(ap(fc, — )) = vdwm{k). The following simple 
fact makes the above remark, that (iii) implies (ii), more precise. 

Lemma 5. For all m E N we have N,„(G') < cr(G, 

We say that hypergraph sequences , . . . , G™ are compatible if for all n we have 
y(G^) = ■ ■ ■ — V^(G™). Generalising the notion of "diagonal vdW-like numbers" 
in Definition ^ and the notion of mixed vdW-numbers in Definition |l|: 

Definition 6. Consider m 6 N and compatible nontrivial monotonic hypergraph 
sequences G\ . . . , G™. Then N^(G\ . . . , G"") e N U {+00} is defined as the 
infimum 0/ n G N such that for every m-colouring of y(G^) there exists some 
i e {1, . . . , m} such that some hyperedge of G^ is monochromatically i-coloured. 

Obviously we have N™(G) = N„(G, ...,G). Cah (G\...,G") horizontally 
monotonic if for all n G N and all 1 < i < j < m every independent subset 
of G\ is also independent in G;^. In this case then N„i(G^, . . . , G™) < N„i(Gm) 
holds. This captures the typical application of "mixed numbers" from Ramsey 
theory. Generalising the notion of "convergence rate" in Definition ^: 

Definition 7. Consider m £ N and compatible nontrivial monotonic hypergraph 
sequences G\ . . . , G™. For q G R>o let cr((Gi, . . . , Gm), g) G N U {+00} be 
the infimum o/n G N such that for all n' > n and for all m-tuples (5i, . . . , Sm) 
of (pairwise) disjoint independent subsets Si of G\, we have < 

A few basic remarks: 



1. cr(G,g) = cr((G),(z). 
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2. cr((G'\ . . . , G™), q) < CT{Gm, ^) if (G\ ■ • ■ , G") is horizontally monotonic. 

3. By definition we have for arbitrary compatible nontrivial monotonic hyper- 
graph sequences that N„(G\ . . . , G") = cr((G\ . . . , G™), 1). By Remark |) 
this generalises Lemma ^. 

For complete hypergraphs we can easily establish the Szemeredi property: 

Lemma 8. For n,k G N let Vj^ be the hypergraph with vertex set {1, . . . ,n} and 
hyperedge set (^^' j.'"'^) • Now for natural numbers fci, . . . , k„i and q > we have 
that ct((V'!^^ , . . . ,V_'^"^),q) is the smallest n > ^^'=^^'°''* — —. Especially we have 
cr{{V^, . . . ,V'^),q) = cr(y'"+\g), which is the smallest n > ^. 

By definition we get the following generalisation of Remark |^ to Definition |^: 

Lemma 9. For m G N consider compatible nontrivial monotonic hypergraph 
sequences G^, . . . , G™, and consider 1 < t < m. Then for p,q E M>o we have 

cr((G\ . . . , G'"),p + q) < max (cr((G\ . . . , G*),p), cr((G*+\ . . . , G"), q)). 

Lemma [s] and Lemma ^ (splitting 1 7 + together yield the basic 

theoretical observation of this paper: 

Theorem 10. Consider I £ N and compatible nontrivial monotonic hypergraph 
sequences G\ . . . ,G'. For n e N let Vn := ^(G^), and for k € N let := 
{Vn, (^")); 0''^d, thus = {Qn)neN is a nontrivial monotonic hypergraph se- 
quence. For X GM. let M{x) G N U {+cx)} be the infimum of n GN such that we 
have \Vn\ > x. Now for every s G K with s > 1 and for every m G Nq we have 

Ni+,n{G\ . . . , G', . . . , Q2) ^ N;+i(G\ . . . , G', Q"+i) < 

max (M(s-m), cr ((G\ . . . , G'), 1 - i)). 

So the growth-rate of m i— >■ Ni+m(G^, . . . , G', Q^, . . . , Q^) is linear for m large 
enough, where the factor can be made arbitrarily close to 1. Applied to vdW- 
numbers, using Szemeredi's theorem, we get the following application (the proof- 
idea here originated from Jan-Christoph Schlage-Puchta). As a special case of 
Definition ^ we use crap(i,q) for parameter tuples t, using the hypergraph se- 
quences belonging to the progression sizes in t. 

Corollary 11. For a parameter tuple t of length I G N, for ni G No and for 
s G M>i we have vdw,„+;((2, . . . , 2); i) < max(s • m + 1, crap(i, 1 — -j))- 

Giving up on the factor, but now without unknown minimal value for m, we 
have the following variation on Corollary fl[ 
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Lemma 12. For a parameter tuple t of length I G N and for m G Nq we have 
vdw„+;((2,...,2);i) < (m + l) -vdw^t). 

Proof. Let n :— (to + 1) • vdwi(t). Now for any S C {1, . . . , n} with jS*! < to the 
set {1, . . . , n} \ 5 contains at least one interval {i, . . . , j} for 1 < i < j < n with 
_;/ — « + 1 = vdw/(t). Using the invariance of linear progressions under translation, 
we obtain the desired inequality. □ 



3.2 Arithmetic progressions in the prime numbers 

We turn to a major strengthening of Szemeredi's theorem. Now the hypergraph 
sequence is given as Gn = appr(/c,n) for fixed fc G N, where the vertex set of 
appr(fc,n) is the set of the first n prime numbers, while the hyperedges are the 
arithmetic progressions of size k (within the first n prime numbers). As before, 
every set of prime numbers having at most two elements is an arithmetic pro- 
gression, but now the first arithmetic progression of size 3 is {3,5,7}, and the 
first arithmetic progression of size 4 is {5, 11, 17, 23}. Until 2004 even condition 
(i) was unknown, that is, whether the primes contain arbitrarily long arithmetic 
progressions, and only with Q not only condition (i) was proven, but even con- 
dition (iii) (the underlying preprint was a major contribution towards the Fields 
medal for Terence Tao in 2006). Actually, until today no other proof of property 
(i) is known than through property (iii)! In analogy to Definition ^ and as a 
special case of Definition ^, we define Green- Tao numbers ( "GT- numbers" ) . 

Definition 13. For a parameter tuple {ki, . . . , km) let the Green-Tao number 
grtjjj(fci, . . . , km) be defined as the smallest no G N such that for every n > 
no and every f : {pi, . . . ,Pn} — > {1, • • • ,1^}, where pi, . . . ,pn are the first n 
prime numbers, there exists some i G {1,...,to} such that f^^{i) contains an 
arithmetic progression of size ki . 

According to Definition ^ we speak of trivial GT-numbers, simple GT- 
numbers, core GT-numbers, transversal GT-numbers, extended core 
GT-numbers, and diagonal GT-numbers. 

Theorem ^ applied to Green-Tao numbers, using Green- Tao's theorem (^), 
yields that extended GT-numbers grow linearly. For the explicit statement, as 
for Corollary |l^ and as a special case of Definition |^, we use cr'^p{t,q) for pa- 
rameter tuples t, using the hypergraph sequences in the primes belonging to the 
progression sizes in t. 

Corollary 14. For a parameter tuple t of length I G N, for m G No and for 
s G M>i we have grt„^;((2, . . . , 2); t) < max(s • to -t- 1, crPp(i, 1 — i)). 
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3.3 Remarks on transversal numbers and transversal extensions 



Given a nontrivial monotonic hypergraph sequence G, the (computational) deter- 
mination of the simplest transversal extension numbers Ni_|_m(G, Q^, . . . , Q^) = 
N2(G, (5™+^) (recall Theorem |lO|), with the special cases vdwm+i(2, . . . , 2, fc) 
and grt„j^;^(2, . . . , 2, fc), is relatively(!) easy, since essentially we have to com- 
pute the transversal numbers T{Gn) of the hypergraphs G„ (though still an NP- 
complcte task in general), that is the minimum size of a set of vertices having 
non-empty intersection with every hyperedge. This is also the motivation for the 
notion of "transversal iV- number" and "transversal extension": N2(G, Q™+^) is 
the smallest n such that T{Gn) > rn. The complements of independent sets in a 
hypergraph G are exactly the transversals of G, and thus r(G) + a{G) — \V{G)\ 
holds. So determination of the transversal numbers for the hypergraph sequence 
G determines the convergence rate w.r.t. the Szemeredi property, and is therefore 
of strong interest (recall Lemma ||). 

Considering the computation of r(G,i) for the vdW- and the GT-sequence 
of hypergraphs, going from G„ to G„+i only one vertex is added, and thus we 
have a relatively slow growth of complexity compared to transversal extensions 
of core tuples, without a clear boundary of what becomes "infeasible" . These 
problems also require some special treatment (using cardinality constraints or 
special hypergraph transversal algorithms). So we put the results on transversal 
vdW- or GT- numbers only into the appendix (see Section ^), where we used 
the most direct method for computing transversal numbers of hypergraphs via 
SAT (see the introduction to Section ^) . 



Special methods are applicable regarding the transversal numbers of vdW- 
hypergraphs, which exploit the translation invariance of arithmetic progres- 
sions; see p7| , p8[ for the basic ideas, and see the case fc = 3 in Subsection 
A.l for data derived by such special methods. even found for "small" m 
a precise formula for vdw„i+i(2, . . . , 2, fc). 

Regarding grt^_|_;^(2, . . . , 2, fc), such a (simple) formula likely does not exist, 
and also we loose translation invariance of the arithmetic progressions (since 
they must lie in the primes) , so computing the minimum size of hypergraph 
transversals via SAT solving seems a good option, but still should (and can) 
exploit special properties (not investigated in this paper). 
It seems that combining these special methods with SAT solving should yield 
the best results. 



Finally we mention that also transversal extensions of core tuples can be trans- 
lated into SAT problems by combining the general translation methods of the 
following section with cardinality constraints (which take care of the initial tu- 
ples of 2's). In this article we concentrated on the foundations and on the study of 
the various general translation schemes, so also the investigations of this special 
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treatment had to be postponed (that is, the GT-numbers for transversal ex- 
tensions reported in Section ^ have been obtained by just applying the general 
translations of non-boolean problems into boolean problems) . 

4 The generic translation scheme from non-boolean 
clause-sets to boolean clause-sets 

GT-problems of the form "grt2(fci,fc2) > n ?" have a natural formulation as 
(boolean) SAT problems by just excluding the arithmetic progressions of sizes 
ki and ^2, e.g. the problem "grt2(2,3) > 4 ?" yields the (satisfiable) clause- 
set {{2, 3}, {2, 5}, {2, 7}, {3, 5}, {3, 7}, {5, 7}, {-3, -5, -7}} over the variable- 
set {2, 3, 5, 7} (thus the answer is "yes"). A natural translation for arbitrary m 
is given when using generalised clause-sets as systematically studied in [ |r7|JT9|pO| , 
which allow variables v with finite domains and literals of the form "w ^ e" 
for values e e ■ The problem of colouring a hypergraph G with m colours is 
naturally translated into a SAT problem for generalised clause-sets via using m 
clauses for every hyperedge H G E{G), namely for every value e G {1, . . . , m} the 
clause {w 7^ £ : w £ H}, stating that not all vertices in H can have value e (note 
that the vertices of G are used as variables with (uniform) domain {1, . . . , m}). 
Accordingly we arrive at the natural generalisation F^^"^ f, (n) of the boolean 
formulation, using as variables the first n prime numbers, each with domain 
{!,..., m}, where the clauses are obtained from the hyperedges of appr(fci, n) 
for i e {1, . . . , m} by using literals "u ^ i" . 

As a running example consider m ^ 3, ki = k2 = = 3 and n = 5. 
We remark that we have grt3(3) = 137, as can be seen in Section ^. Only one 
hypergraph needs to be considered here (since all fci-values coincide), namely 
appr(3, 5) — ({2, 3, 5, 7, 11}, {{3, 5, 7}, {3, 7, 11}})- Now the (non-boolean) clause- 
set 1^3^3(5) uses the five (formaQ variables 2,3,5,7,11, each with domain 
{1, 2, 3}, while we have 3-2 = 6 clauses (each of length 3), namely the clauses 
{(3, i), (5, z), (7, i)}, {(3, i), (7, ^), (11, i)} for i e {1, 2, 3}. 

In the nested translation from generalised clause-sets to boolean clause- 
sets was introduced, while the generalisation to the generic translation scheme is 
outlined in [^ . Given a generalised clause-set F, for every variable an (arbitrary) 
unsatisflable boolean clause-set T{v) is chosen, such that for different variables 
these clause-sets are variable-disjoint. Furthermore for every value e £ Dy a 
necessary clause 71,(2) G T{v) is chosen (that is, T(w) \ {71, (e)} is satisfiable), 
such that to different values different clauses are assigned. Now the translation 
Tj{F) of F under T and 7 replaces for every clause C E F the (non-boolean) 



note that variable 2 does not occur here; it occurs only for ki — 2, and one could 
ignore it in general, however then we always had to use the offset 1 when comparing 
with prime number tables 
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literals v ^ e hy the (boolean) literals in clause 7t,(£), and adds for every variable 
V e var(i^) the clauses of the (boolean) clause-set T{v) \ {"fvis) ■ £ G -Dt,}. The 
clauses 7i,(e) are called the main clauses of T(w), while the other clauses of T{v) 
constitute the remainder. 

Lemma 15. T^{F) is satisfiability- equivalent to F. 

Proof. If is a satisfying assignment for F, then for every variable v € var((/j) 
choose a satisfying assignment ipv of T(v) \ {'Jvi'fiiv))}, and the union of these 
(compatible) assignments "011 yields a satisfying assignment for Tj (F) (here it is 
used that for e & Dy\ {(p{v)} we have 7«(e) ^ -fy{(p{v))). If on the other hand 
is a satisfying (total) assignment for Tj{F), then for every clause-set T{v) 
there exists some & Dy such that the clause 7t,(e) is falsified by V'; now the 
assignment v ^ satisfies F. □ 

The seven instances of the generic scheme used in this paper, where the domain 
of variable uis{l,...,m}, and where the boolean variables are for appropriate 
indices i, are as follows: 

1. T{v) — Dm '■— { {vi}, . . . , {vm}, {vT, ■ ■ ■ , } } with TTi variables is used for 
the weak direct translation, where jv{i) '■= {vi}- F>m is a marginal minimally 
unsatisfiable clause-set|^ with deficiency 1 (that is, with m + 1 clauses). The 
strong direct translation uses T{v) = := D„iU {{vi,Vj} : I <i < j < m} 
and the same 7u-0 

2. The weak reduced translation uses m — 1 variables with T(v) — Dm~i and 
an arbitrary bijection j.^ (note that D,n-i has m clauses), while the strong 
reduced translation uses the same 7„ and T[v) — D'^_-^. Different from 
the direct translations, here 71, plays a role now, namely the question is to 
which value one associates the long clause {TTT, . . . , Vm^i}, and so we have 
m (essentially) different choices. 

Note that clause-set Dm-i can be obtained from Dm by DP-reduction for 
variable Vm (replacing all clauses containing variable Vm by their resolvents 
on w,„), and accordingly from a clause-set translated by the (weak/strong) 
direct translation we obtain the clause-set translated by the (weak/strong) 
reduced translation by performing DP-reduction on all such variables (us- 
ing that the remainder- clauses are just used as they are, without additional 
literals in them). 



See [13 1 for an overview on minimally unsatisfiable clause-sets. 

In | |25| ] the "strong direct translation" is called "direct encoding" , starting from 
arbitrary CSP-problems (while we start from generalised clause-sets). We prefer to 
distinguish between "encodings", which are about variables and the mapping of 
assignments, and "translations", which concern the whole process, and which can 
use quite different but semantically equivalent clause-sets for example. For the direct 
translation it seems that always the strong form is better, but this is not the case for 
other translations, and so we explicitely distinguish between "weak" and "strong". 
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3. The weak nested translation uses m — 1 variables and T{v) = Hm-i, where 

Hm ■■= { {Vl}, {V1,V2}, . . . , {vT, ■ ■ ■,v:^,V,n}, • • ■ , W^} }, 

using some arbitrary bijection 7„ (note that Hm has deficiency 1, and thus 
Hm-i has m clauses). Hm is up to isomorphism the unique saturated mini- 
mally unsatisfiable Horn clause-set with m variables, and in fact is a satura- 
tion of the minimally unsatisfiable clause-set Dm (see ) . The strong nested 
translation uses the same 7t,, and, similar to the strong direct translation, 
T{v) = := Hm-i U {{vi, Vj} : 1 < i < j < m — 1} . For both forms now 

we have ml/2 (essentially) different choices for 7^, (note that only the two 
clauses of length m in iJ,„ can be mapped to each other by an isomorphism 
of Hjn)- The motivation for the introduction of the weak nested transla- 
tion in was that first the number of clauses is not changed by the 
translation, that is, T{v) is minimally unsatisfiable (also Dm-i fulfils this), 
and second that T{v) is a hitting clause-set, that is, every pair of different 
clauses clashes in at least one variable. These two requirements ensure that 
the conflict structure of the original (non-boolean) clause-set is preserved by 
the (boolean) translation. Instead of using Hm-i one could actually use any 
unsatisfiable hitting clause-set with m clauses here. 

4. The simple logarithmic translation^ considers the smallest natural number 
p with 2P > m, and sets T{v) ~ Ap, where Ap consists of all 2^ full clauses 
over variables Vi, . . . ,Vp, while 7„ is an arbitrary injection.'^ 



With the exception of the direct translation, which is fully symmetric in the 
clauses 7«(e), one has to decide about the choice of necessary clauses. With 
the exception of the simple logarithmic translation this is the choice of a suitable 
bijection, i.e., a question of ordering the values of the variables. In this initial 
study we have chosen a "standard ordering" , with the aim of minimising the 
size of the clause-set, by simply assigning the larger clauses to the larger de- 
values (since the larger the size of arithmetic progressions the fewer there are). 
Considering our running example 3(5) we obtain the following 7 translations: 

1. For the direct encoding we get 5 ■ 3 = 15 boolean variables Vp,i for p G 
{2,3,5,7} and i G {1,2,3}. The clause {(3, i), (5, i), (7, i)} is replaced by 
{v3,i,V5^i,vj,i} for i G {1,2,3}, while clause {(3, «), (7, i), (11, i)} is replaced 
by {^3,i: "fy.ij vii,i}- For the weak translation we have the 5 additional clauses 

®^ called the "log encoding" (for CSP-problems) in 

°' If 2^ = m, then there is (essentially) only one choice for 7„, however otherwise 
the situation is more complicated, and also resolutions are possible between the 
remaining clauses, shortening these clauses, and these shortened clauses can be used 
to shorten the main clauses. Therefore we speak of the "simple" translation, and 
further investigations are needed to find stronger schemes when m < 2^ . 
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{v^,v^,v^} for p G {2, 3, 5, 7, 11}, while for the strong translation addi- 
tionally we have the 5- (2) = 15 binary clauses {vp,j, Vpj} for p e {2, 3, 5, 7, 11} 
and i,j E {1, 2, 3), i < j. 

2. For the reduced encoding we get 5 • 2 = 10 boolean variables Vp^i for p G 
{2,3,5,7,11} and i e {1,2}. The clause {(3,i), (5,«), (7,i)} is replaced by 
{u3.«,W5,i,W7,i} for i e {1,2} rcsp. by {vsj, vs^, tJgJ, v^, vTJ, Wa) for i = 3, 
while clause {(3,i), (7, «), (11, i)} is replaced by {t'3,i, 1^7,1, ^^iiji} for « G {1,2} 
resp. by {v^, v^^, Wa,W^, ^11,2} for « = 3. For the weak translation 
there are no additional clauses, while for the strong translation we have 
5 ■ (2) = 5 additional binary clauses {up,i, Wp_2} for p E {2, 3, 5, 7, 11}. 
Note that due to our standardisation scheme the long replacement-clause is 
uniformly used for i = 3, while actually for each of the five (non-boolean) 
variables 2, 3, 5, 7, 11 one could use a different i G {1, 2, 3}. 

3. For the nested encoding we also get 5 • 2 = 10 boolean variables Vp^i for 
p G {2,3,5,7, 11} and i G {1,2}. The clause {(3,i), (5,i), (7,i)} is replaced 
for i = 1,2,3 by respectively {v3^i,V5,i,vr,i}, {WJ, "3,2,?^, W5, 2,^74, «7,2}, 
{WJ,Wl,WJ,V5^,V7j,Wa}^ while clause {(3,i), (7,i), (11,«)} for i = 1,2,3 
is replaced by respectively {t^s,!, i'7,i) ''^11,1}) {v3?i,V3,2,W^,V7,2,vTTJ,vii^2}, 
{WJ, W^, WJ, W^: I'll,! ) ^'11,2}- For the weak translation there are no addi- 
tional clauses, while for the strong translation we have 5 ■ (2) =5 additional 
binary clauses {vp^i^Vp^^} for p G {2, 3, 5, 7, 11}. 

Note (again) that due to our standardisation scheme the order of the three 
replacement-clauses is fixed for each variable, while for each variable one 
could use one of the 3! = 6 possible orders. 

4. Finally, for the logarithmic encoding we get (again, but here this is just 
an exception) 5 • 2 = 10 boolean variables Vp^i for p G {2,3,5,7,11} and 
i E {1,2}. We use the order A2 = { {ui, W2}, {i^, "2}, {iJT, t^}, {ui, W2} }, 
where the first three clauses are used for the values i = 1,2,3. Then the clause 
{(3, i), (5, i), (7, i)} is replaced for i = 1, 2, 3 by {ws.ii ^"3,2, W5,i> f^5,2, ^7,1) ^7,2}, 
{Wa, w3_2 , W5,2 , WJ, ■^7,2}, {WJ, Wa^ WJ, vT^} respectively, and 
{(3, j), (7, i), (11,?)} is replaced resp. by {w3,i> '^'3,2, W7,i' "^7,2, vii.i, i'ii,2}, 
{^^,^'3,2,WJ,^'7,2,wTM,^'ll,2}, {i^,i^, WJ, ^^2, WTJ, wn^}- Additional- 
ly we have the 5 clauses {vp^i,v^} for p G {2, 3, 5, 7, 11}. 



Somewhat surprisingly, in many cases considered in this paper the weak 
nested translation turned out to be best (from the above 7 translations consid- 
ered), for all three types of solvers, look-ahead, conflict-driven and local-search 
solvers (where for the latter an appropriate algorithm has to be chosen). Only 
for larger number of colours is the logarithmic translation superior (for local 
search, with various best algorithms; complete solvers were not successful on 
any of these instances (with larger number of colours)), while in all cases the 
weak nested translation was superior over the direct translation (weak or strong, 
for all solver types). 
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5 Computing Green- Tao numbers 



For trivial GT- numbers as with vdW- numbers we have grtj„(2) = m+1. However 
the simple GT-numbers are non-trivial: grt]^(fc) is the smallest n such that the 
first n prime numbers contain an arithmetic progression of size k. Only the values 
for 2 < fc < 21 are known, given by the sequence 2, 4, 9, 10, 37, 155, 263, 289, 316, 
21'966, 23'060, 58'464, 2'253'121, 9'686'320, 11'015'837, 227'225'515, 755'752'809, 
3'466'256'932,22'009'064'470,220'525'414'079.[3lt seems likely that considera- 
tion of GT-numbers for core tuples involving fc > 11 is infeasible (since the first 
21966 prime numbers need to be considered just to see the first progression of 
size 11). 

Considering the generalised clause-sets F — F^'^ j, (n) (recall Section 
the number n{F) of (formal) variables is n, while the number c{F) of clauses is 



Si" il^(^Ppr(^i:'^))l-[f^ So to compute the number of clauses in F, we have to 
compute how many arithmetic progressions of size fc there are for a given n. In 
other words, what can be said about the number |i?(appr(fc, n)) \ of hyperedges in 
the GT-hypergraphs? Exploiting the famous (unproven) "m-tuples conjecture" 
of Hardy-Littlewood, various asymptotic formulas (where the quotient with the 
true value is approaching 1 with n going to infinity) are given in . Translated 
into our context, where we rank the primes, for arbitrary N £ No the formula (7) 
from yields, using x := n ■ logn (which is an asymptotically precise formula 
to translate from the rank n to the associate n-th prime number p„): 



|i?(app,(fc,n))|^Cfe.— — ^.(1 + 5] 



N 



(logx)'' ^ (logx)' 

(for fixed fc, proven meanwhile for fc < 4; precise formulas for C'k and the are 
also given in |[To||). Just using linear regression to determine Ck and the Uk^i, using 
N = 2, yields very good approximations over the ranges we are considering. 

Solvers used are the algorithms from the Ubcsat local-search suite (jsH), 
minisat2 (0) for conflict-driven solvers (on our instances either minisat2 
was superior or not much worse than all other publicly available conflict-driven 
solvers, and thus it seems that the optimisations applied to minisat2 in other 
solvers don't improve performance on our instances), and OKsolver-2002 (p^), 
march_pl ([^) and satz (|2^) for look-ahead solvers. In one (largest) case 
survey propagation (§]) was successful (with 708206 clauses of length 5). If 
not stated otherwise, for all non-boolean cases the weak (standard) nested trans- 
lation is best (considering complete and incomplete solvers), and if not otherwise 



11) 



This data is available at tittp : //users . cybercity.dk/~dsl522332/math/ 



aprecords.htm, in the form of the prime numbers themselves, not their indiecs (as 
used by us), and so we needed to rank the prime numbers there. 
It seems that for n not much smaller than grt^(A;i, . . . , km) all variables actually 
occur with the exception of prime number 2, which occurs iff some ki = 2 exists. 
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stated, for lower bounds rnovelty+ is best. Recall that a lower bound stated as 
"> n" means that we conjecture that actually equality holds. 

We were able to compute five core GT-numbers, for 3 core numbers we have 
reasonable conjectures, and for 9 core numbers we have hopefully not unrea- 
sonable lower bounds. Furthermore we were able to compute 12 extended core 
GT-numbers, while for 16 cases we have conjectures. Transversal GT-numbers 



are presented in Section A. 2; see Subsection 3.3 for general remarks. 



1. 4 binary core GT-numbers grt2(a, b) are known: 





3 


4 


5 


6 7 


3 


23 


79 


528 


> 2072 > 13800 


4 




512 


> 4231 




5 






> 34309 





For (5, 5) we experienced the only case where survey propagation was 
successful (converging for n < 34309, diverging for n > 34309). For the 
other lower bounds adaptnovelty+ is best. OKsolver-2002 is best for 
(4,4), while for (3,5) minisat2 is best, followed by march_pl. 

2. One ternary core GT-number grt3(a, b, c) is known: 



1, c 

a, b 


3 


4 5 


3,3 


137 


> 434 > 1989 


3,4 




> 1662 > 8300 


4,4 




> 5044 



For (3, 3, 3) the logarithmic translation performed best, with minisat2 
fastest, followed by OKsolver-2002. For (3, 4, 5) rnovelty performed best. 



a, 5, c, d) 


is known: 


a, b, c 


3 4 


3,3,3 


> 384 > 1052 


3,3,4 


- > 2750 



4. Extending (3, 3) by m 2's, i.e., the numbers grt^^jl^, • ■ • , 2, 3, 3): 



to|| 1 1 


2 


3 


4 


5 


6 


7 8 


9 


10 


11 


12 


13 


14 


||23|31 


39 


41 


47 


53 


55 


> 60 > 62 


> 67 


> 71 


> 73 


> 82 


> 83 


> 86 



minisat2 is the best complete solver here (also for the other (complete) 
cases below). For the lower bounds the logarithmic translation is best, with 
rsaps except for m = 13 where walksat-tabu without null-flips is best. 



5. Extending (3, k) for fc > 4 by m 2's, i.e., the numbers grt„ , 2(2, . . . , 2, 3, k): 
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k 





1 


2 


3 


4 5 


6 


4 


79 


117 


120 


128 


136 > 142 


> 151 


5 


528 


581 


> 582 


> 610 







For fc = 5, TO = 2 saps is best, and for to = 3 walksat. For fc = 4, to = 6 
walksat-tabu with the logarithmic translation is best. 

6. Extending (4, k) by to 2's, i.e., the numbers grt,„+2(2, ■ • . , 2, 4, fc): 



k ™ 





1 


2 


4 


512 


> 553 


> 588 



sapsnr is best (for the fower bounds). 
7. Extending (3, 3, k) by m 2's, i.e., the numbers grt„_,_3(2, . . . , 2, 3, 3, fc): 



k ™ 





1 


2 


3 


137 


151 


> 154 


4 


> 434 


> 453 


> 471 



Some final remarks: 

1. For vdW- numbers. Conjecture |^ generahsed to all core tuples says that in 
every row of a table of core numbers (that is, in a core tuple one component 
grows while the others are fixed) we would have growth-rates 0{'n^) (where 
k depends on the row). Now for GT- numbers a first guess is that we have 
growth-rates 0(exp(n'^)). 

2. For vdW-numbers, in j2^. Research Problem 2.8.6, it is conjectured that 
vdw2(fc,fc) > vdw2(fc--l,fc + l) > vdw2(fc-2,fc + 2) > ••• > vdw2(2, 2fc-2). 
The sequences N{k, k),N{k - l,k + I), . . . , N{2, 2k + 2) can for N = vdw2() 
be reasonably evaluated for 2 < fc < 6, yielding the sequences 

(3), (9, 7), (35, 22, 11), (178, 73, 46, 15), (1132, ?, 146, 77, 19), 

which supports the conjecture. For GT- numbers (that is, N — grt2()) we 
can reasonably evaluate 2 < fc < 4, obtaining the sequences 

(3), (23, 14), (79, 528, 55), 

and we see that now we have a more complicated behaviour. 

6 Open problems and outlook 



Regarding the generic translation scheme, further extensive experimentation 
is needed w.r.t. the problem of ordering the values and of mixing translation 
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schemes (recall that every variable can be treated on its own). Also further in- 
stances of the generic scheme need to considered, starting with refining the log- 
arithmic translation when the number of values is not a power of 2. Of course, 
finally some form of understanding needs to be established, and we hope that 
the generic scheme offers a suitable environment for such considerations. 



As mentioned in Subsection 3.S , the translation of transversal extension prob- 
lems into boolean SAT can use cardinality constraints, and this needs to be ex- 
plored systematically. This includes the special case of transversal extensions of 
simple tuples, which is basically the hypergraph transversal problem (for these 
special hypergraphs) . 

A fundamental problem is to improve performance on unsatisfiable instances 
(of complete solvers) . The most promising general approach seems to us to sys- 
tematically study the optimisation of heuristics as outlined in . Investigating 
the tree-resolution and full-resolution complexity of these instances should be 
of great interest; we noticed that especially with the OKsolver-2002 the search 
trees show remarkable regularities (of a number-theoretical touch, in a kind of 
"fractal" way) . Exploiting the monotone nature of the hypergraph sequences of 
vdW- or GT-hypergraphs seems also necessary to reach the next level of vdW- 
or GT-numbers (regarding core parameter tuples), where some first (sporadic) 



methods one finds in 14 



In general, it seems to us that instances from Ramsey theory, like vdW- 
instances or GT-instances as considered in this paper, or like the Ramsey- 
instances (and there are many other families) , provide very good benchmarks for 
SAT solvers, combining the power of systematic creation as for random instances 
with various types of "structures" , where the interplay between these structures 
and SAT solving should be of great interest and potential. 
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A Transversal numbers 

A generic way of computing transversal vdW-numbers and transversal GT- 
numbers via SAT-solvers is as follows, using N G {vdw(), grt()} and respectively 
Gk{n) = ap(/c,n) or Gfe(n) = appr(A:,n). 

— Let Tk{n) := T(Gk{n)). 

— Recall that Nm+i{'2^, . . . ,2,k) is the smallest n with Tk{n) > m. 

— So we compute the numbers Tk{n) for n = 1,2,... as far as we get, and 
derive from these transversal numbers the transversal A'^-numbers. 

— We start with rfc(l) = (for A; > 1), and we know 

Tfc(n + 1) e {Tkin),Tkin) + 1}. 

— So for computing T^in -\- 1) we consider the satisfiability problem 

Tk{n+l) = b7 

for b := Tk{n): If this problem is satisfiable (a transversal for Gk{n-\-l) of size 
b exists), then we have Tj;(n-|-1) = b, while otherwise we have rfc(n-l-l) — 6-1-1. 

— To formulate the satisfiability problem as a (boolean) CNF-SAT-problem, we 
consider the vertices of Gk{n-\-l) as variables vi, . . . , w„+i and the hyperedges 
as positive clauses, and we add clauses expressing the cardinality constraint 
"di -I- ■ • • -I- w„_|_i = 5" (considering Vi G {0, 1} C No). 
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The best combination of SAT-solver and cardinality-constraint-translation we 
found uses minisat2 and binary addition. Except for vdwm-|_i(2, . . . , 2, fc) all 
numbers are computed in this way. 



A.l Transversal vdW- numbers 



Now we present the transversal vdW-numbers vdwm+i(2, . . . , 2, fc) we have com- 
puted. Numbers in boldface are given by the (precise) formula in [p2| , while for 
underlined numbers the formula still holds though the m-value is not in the 
domain of proven correctness. 

1. The known numbers vdw„j+i(2, . . . , 2, 3), with m = a ■ 10 + b: 








1 


2 


3 


4 


5 


6 


7 


8 9 





3 


6 


7 


8 


10 


12 


15 


16 


17 18 


1 


19 


21 


22 


23 


25 


27 


28 


29 


31 33 


2 


34 


35 


37 


38 


39 


42 


43 


44 


45 46 


3 


47 


48 


49 


50 


52 


53 


55 


56 


57 59 


4 


60 


61 


62 


64 


65 


66 


67 


68 


69 70 


5 


72 


73 


75 


76 


77 


78 


79 


80 


81 83 


6 


85 


86 


87 


88 


89 


90 


91 


93 


94 96 


7 


97 


98 


99 


101 102 103 105 106 107 108 


8 


109 110 112 


113 115 116 117 118 119 120 


9 


123 124 125 


126 127 128 129 130 131 132 


10 


133 134 135 


136 138 139 140 141 142 143 


11 


144 146 147 


148 149 151 152 153 154 155 


12 


156 158 159 


160 161 162 164 166 167 168 


13 


170 171 172 


173 175 176 177 178 179 180 


14 


181 182 183 


184 185 186 187 188 189 190 


15 


191 192 193 















Here via SAT solving only up to r(ap(3; 101)) 
could be computed (yielding vdw74_|_i(2, . . . , 2, 3) 
rn > 74 has been collected by Jarek Wroblewski at tittp: //www. math. uni. 



74, T(ap(3,102)) = 75 
102), while the data for 



wroc.pl/~jwr/non-ave.htn, using special methods. The data is available 
in the form of an "a-steplist" , that is, for index i ~ 1, 2, 3, . . . the smallest 
fli = ri G N with a(ap(3, n)) ~ i is given. Jarek Wroblewski conjectures that 
fli < i^'^- This data for i = 6 • 10 -I- c is given by the following table. 



1 



6 7 



9 10 



1 2 4 5 9 11 13 14 20 24 
26 30 32 36 40 41 51 54 58 63 
71 74 82 84 92 95 100 104 111 114 
121 122 137 145 150 157 163 165 169 174 
194 
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We would get such sequence of these numbers also directly from the runs 
of the SAT-solvers (as discussed above, however only as far as we get), by 
collecting all the n for which a satisfiable instance was obtained (which 
means that the transversal number didn't change, which is equivalent to the 
independence numbers making a step (of +1)). 
2. The known numbers vdwm+i(2, . . . , 2, 4), with m = a - 10 + b: 





0123456789 





4 7 11 12 14 16 18 20 22 24 


1 


26 29 31 32 35 36 38 39 41 42 


2 


44 46 47 49 51 52 55 56 57 59 


3 


61 62 63 65 67 69 71 72 73 75 


4 


76 78 80 



3. The known numbers vdwTO+i(2, . . . , 2, 5), with m = a ■ 10 + 6: 





012345 6 7 8 


9 





5 10 15 20 21 22 23 26 30 


32 


1 


35 40 45 46 47 48 50 53 55 


60 


2 


65 70 71 72 73 74 75 80 85 


90 


3 


95 96 97 98 99 100 101 102 103 





4. The known numbers vdwm+i(2, ...,2,6), with m = a • 10 + 6: 





0123456789 





6 11 16 21 27 28 30 31 34 38 


1 


42 43 47 52 53 55 57 60 63 67 


2 


69 72 77 78 79 81 84 



5. The known numbers vdwm_|_i(2, . . . , 2, 7), with m = a • 10 + 6: 





1 2 3 4 5 6 


7 8 9 





7 14 21 28 35 42 43 


44 45 47 


1 


49 54 58 62 66 70 77 


84 91 92 


2 


93 94 96 97 99 105 108 





6. The known numbers vdwm+i(2, ...,2,8), with m = o • 10 + 6: 





1 2 


3 4 5 6 7 8 9 





8 15 22 


29 36 43 51 52 53 55 


1 


57 60 64 


70 73 79 81 86 93 100 


2 


101 102 103 





7. The known numbers vdwm+i(2, . . . , 2, 9), with m = a ■ 10 + 6: 



23 








12345678 9 





9 


18 25 32 39 46 53 58 59 62 


1 


66 


72 74 77 81 87 91 97 102 106 


2 


110 





8. The known numbers vdwm+i(2, . . . , 2, 10), with m = a ■ 10 + 6: 



a b 


1 2 3 4 .") () 7 8 


9 





10 19 29 34 41 48 55 62 65 


69 


1 


74 79 85 89 92 96 101 106 110 





9. The known numbers vdwm+i(2, . . . , 2, 11), with m = o • 10 + 6: 



1 



7 8 



II 22 33 44 55 66 77 88 99 110 

III 112 113 114 116 118 119 121 129 



10. The known numbers vdwm+i(2, . . . , 2, 12), with m = a ■ 10 + b: 



1 



9 



4 



(i 



8 



12 23 34 45 56 67 78 89 100 111 

123 124 125 126 127 129 130 133 



11. The known numbers vdwm+i(2, . . . , 2, 13), with m = a • 10 + 6: 







5 6 



13 26 39 52 65 78 91 104 117 130 

143 156 157 158 159 160 162 163 



12. The known numbers vdwm+i(2, ■ . ■ , 2, 14), with to = a • 10 + 6: 



1 



6 7 



14 27 40 53 66 79 92 105 118 131 

144 157 171 172 173 174 175 176 



13. The known numbers vdw„i_|_i(2, . . . , 2, 15), with m 



1 



6 7 



a • 10 + 6: 

8 9 



15 30 43 56 69 82 95 108 121 134 

147 160 173 184 185 186 188 189 



14. The known numbers vdwm+i(2, .... 2, 16), with to = a • 10 + 6: 



1 



6 7 8 9 



16 31 47 58 71 84 97 110 123 136 
149 162 175 188 197 199 200 202 
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15. The known numbers vdwm+i(2, . . . , 2, 17), with m = a ■ 10 + b: 







1 



3 



7 8 9 



17 34 51 68 85 102 119 136 153 170 
187 204 221 238 255 272 273 274 275 276 

277 



16. The known numbers vdwTO+i(2, • . • , 2, 18), with m = a • 10 + 6: 



1 



18 35 52 69 86 103 120 137 154 171 
188 205 222 239 256 273 291 292 293 294 

295 



17. The known numbers vdwTO+i(2, • . • , 2, 19), with m = a ■ 10 + b: 



1 



19 38 57 76 95 114 133 152 171 190 
209 228 247 266 285 304 323 342 343 344 

345 346 



A. 2 Transversal GT-numbers 



1. The known numbers grt^_|_i(2, ...,2,3), with m = a • 10 + 6: 




1 
2 

3 
4 



0123456789 



4 7 9 13 14 16 18 21 22 23 
28 29 30 32 33 36 38 39 40 42 
43 47 48 49 50 52 55 56 57 59 
61 62 65 68 69 70 71 72 73 75 

76 78 80 81 



2. The known numbers grt^_|_i(2, ...,2,4), with m = a • 10 + 6: 



12 3 4 



G 7 8 9 



9 14 17 22 26 32 35 36 37 45 
46 51 56 58 61 62 71 72 73 78 
79 83 



3. The known numbers grt„^j(2, . . . , 2, 5), with m = a • 10 + 6: 





1 2 3 4 5 


6 7 8 9 





10 31 32 49 58 61 


62 78 87 98 


1 


107 112 121 123 142 143 





4. The known numbers grt,„^j(2, . . . , 2, 6), with m = a ■ 10 + b: 





12 3 4 


5 6 7 8 9 





37 55 64 71 90 


97 125 152 162 179 


1 


201 204 211 212 250 





5. The known numbers grt^_|_]^(2, . . . , 2, 7), with to = a • 10 + 6: 





1 2 3 4 5 6 


7 8 9 





155 214 228 232 323 396 570 


641 715 796 


1 


827 872 875 1048 1125 1158 1180 





6. The known numbers grt^_|_2(2, ...,2,8), with to = a • 10 + 6: 



a' 


12 3 


4 5 6 7 8 9 





263 349 665 789 


1323 1428 1447 1473 1555 1801 


1 


1881 1935 1979 2117 





